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Abstract —This work explores the rate-reliability-complexity 
limits of the quasi-static ft'-user multiple access channel (MAC), 
with or without feedback. Using high-SNR asymptotics, the work 
first derives bounds on the computational resources required to 
achieve near-optimal (ML-based) decoding performance. It then 
bounds the (reduced) complexity needed to achieve any (including 
suboptimal) diversity-multiplexing performance tradeoff (DMT) 
performance, and finally bounds the same complexity, in the 
presence of feedback-aided user selection. This latter effort 
reveals the ability of a few bits of feedback not only to improve 
performance, but also to reduce complexity. In this context, the 
analysis reveals the interesting finding that proper calibration 
of user selection can allow for near-optimal ML-based decoding, 
with complexity that need not scale exponentially in the total 
number of codeword bits. The derived bounds constitute the 
best known performance-vs-complexity behavior to date for ML- 
based MAC decoding, as well as a first exploration of the 
complexity-feedback-performance interdependencies in multiuser 
settings. 

I. Introduction 
A. Multiple access system model 

We consider a symmetric multiple access channel (MAC) 
with K single-antenna users, communicating to a receiver with 
n r receiving antennas, over a quasi-static fading channel. Each 
user i, i = 1,2,..., K communicates over the same duration 
of T time slots, while the receiver accumulates an n r x T 
signal matrix Y taking the form 

K 

Y = v'SNR ^ hi + W (1) 

i =1 

where hi ~ CAf ( 0 . ) is the ith user channel vector 

with independent identically distributed (i.i.d.) zero-mean unit- 
variance CA f (0, 1) complex Gaussian entries, where W is 
the received noise matrix with i.i.d. CA/"(0,1) entries, where 
SNR denotes the signal to noise ratio, and where x t = 
[xip ■ ■ ■ aft,T] T is a scaled version of a T-length code vector 
of user i satisfying an average power constraint IE |||| 2 < 

T, i = K. 

1) Exploring the scenario of outage-limited communica¬ 
tions with bounded computational resources: We here con¬ 
sider the outage-limited MAC setting, where the channel in 
0 is randomly drawn but it remains fixed throughout the 
coding duration of T channel uses. This common assumption 
corresponds to scenarios where the channel changes slowly, 
and where communication takes place under strict latency 
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constraints that do not allow for encoding over a large number 
of fading realizations. 

This same outage-limited setting can often experience re¬ 
duced reliability due to the event of outage where the in¬ 
stantaneous channel cannot support the user rates, and due to 
the fact that in cases like the MAC, the maximum allowable 
rates are diminished by interference. Moreover, to meet the 
strong demands for faster data rates on wireless and cellular 
channels, most communications systems would opt to operate 
closer and closer to the theoretical limits of capacity, at the 
further expense of reliability. This in turn, naturally forces the 
need for high-performance transceivers that will not further 
sacrifice reliability performance, but which can often be com¬ 
putationally expensive, or even computationally prohibitive. 

What happens though to this performance if communica¬ 
tion takes place under strict constraints on the computational 
resources that can be used to encode and decode? Equiva¬ 
lently, one can ask, what computational resources are needed 
to achieve a certain rate-reliability performance. The main 
objective of our work is to provide some understanding of such 
complexity-performance interdependencies that are crucial in 
the MAC. 

2) Complexity-performance interdependencies in the MAC.: 
Naturally there are many such interdependencies between the 
key parameters of SNR, rate, computational complexity and 
reliability. For example it is easy to imagine that, typically, 
decreasing computational resources can potentially increase 
the probability of error by placing a constraint on the coding 
duration T, by forcing the use of less complex transceivers 
with suboptimal performance, or even by requiring that the 
decoding effort be terminated early (computational halt) at the 
risk of additional errors. Similarly, increased user rates can 
mean larger and denser codes, with more errors and larger 
decoding algorithmic efforts. Along the same lines, fixing the 
rate and increasing the SNR, will make the codewords sparser 
and thus possibly easier to differentiate and decode. 

B. Complexity-performance measures and high-SNR approxi¬ 
mations 

Let iV tnax denote the amount of computational reserves, in 
floating point operations (flops) per T channel uses, that the 
system is endowed with, in the sense that after Aj nax flops, the 
decoder must simply terminate, potentially prematurely and 
before completion of the task, thus declaring an error. Also 
let P e be the probability of error associated to the decoder, 
in the presence of the aforementioned computational con¬ 
straints. Motivated by the need for reduced-complexity high- 
performance ML-based decoders (cf. EE®)’ we explore 
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the properties of this quadruple (SNR, R, P e . Aj nax ), for ML- 
based MAC decoding. 

1) High SNR analysis of performance and complexity: 
Towards making sense of the complexity-vs-performance in¬ 
terdependencies, we will exploit carefully-chosen asymptotic 
bounds, where analytical tools can rigorously and concisely 
approximate these interdependencies, allowing for cleaner 
insights that can yield impact. Such asymptotic bounds (gen¬ 
erally mapping a random problem to its near-deterministic 
limits), help to identify where the largest gaps in our under¬ 
standing may lie. We will here focus on high-SNR asymptotic 
bounds but note that the derived approximations can offer in¬ 
sights in operational regimes of moderate-to-high SNR values. 
Such emphasis on the moderate-to-high SNR values, can better 
capture the core of the complexity-vs-performance problem 
because such operational regions can eventually support higher 
rates, corresponding to larger codebooks that are typically — 
but not always, as we will surprisingly see later on — harder 
to decode. 

Towards this, we let each user’s rate scale as R = 
r log 2 SNR, where r denotes the multiplexing gain |6J de¬ 
scribing the density of the constellation, and serving as a 
measure of how far each user’s rate is from the capacity of an 
AWGN channel. Each user i employs a code X rl of cardinality 
\X ri \ = 2 rt = SNR rT , and the receiver uses a decoder V r , 
which is considered to make an error if any of the K users’ 
messages is not decoded correctly. We restrict our attention to 
the class of lattice codes and joint ML decoders, which we 
describe in Section lEDl that follows. 

2) Complexity and reliability exponents: For complexity 
analysis we adopt the high-SNR approach in 111. (2), where 
for some r, for some encoders | X r , } A,, and for a decoder 
T> r , the complexity exponent c{r ) was defined to be 


c(r) 


lim log H max { t ) 
SNR—Too log SNR 


( 2 ) 


We also adopt the well-known DMT approach of (6j where, 
in the same high SNR regime, the diversity gain d(r) takes 
the form 


d{r) 


lim 

SNR—>-oo 


log P e (r) 

log SNR ' 


(3) 


Using this asymptotic approach, the work in @ has shown 
that the optimal A'-user MAC DMT performance, takes the 
form 


<4ac0) 


n r (l-r), if 0 <r< 1 ?f- i , 
d*K,n r (Kr), 


(4) 


where d* mn {r ) denotes the optimal DMT of an (m x n) 
MIMO channel (cf. © for its exact characterization). Our first 
effort here will be to bound the complexity exponent c(r) that 
can guarantee any MAC-DMT performance d(r) < d* lac (r). 
Before we do that, let us try to get some insight on these two 
competing exponents. 

3) Insight on d(r) and c(r): To gain some insight, we note 
that in terms of the MAC-DMT performance, the above ex¬ 
pression in 0 reflects the existence of two distinct r regions; 
the lightly-loaded multiplexing gain region 0 < r < ppfi 
where the MAC exhibits single-user behavior as if there 


were no multiuser interference, and the heavily-loaded region 
jf'W < r < n f: , where the MAC exhibits an antenna-pooling 
behavior. 

In terms of the complexity exponent c(r), it is easy to see 
that in this MAC setting, there is a total of 2 RKT = SSR rKT 
codewords (jointly from all users), which means that a brute- 
force joint ML decoder would always make 2 RK1 codeword 
visits [U Thus in terms of exponents, this means that such 
a brute-force optimal ML decoder, can achieve the optimal 
^mac( r ) with a required complexity exponent c(r) = rKT that 
is indeed the maximum (meaningful) complexity exponent in 
this setting] We will show that a properly designed sphere 
decoder can achieve this same ML performance, with a 
reduced complexity exponent c(r) < rKT. To give the reader 
an idea, we can recall from the work in jl j that, if for example, 
there is only one user (I\ = 1) who has nt transmit antennas, 
and the receiver has n r > nt antennas, then — again in the 
high SNR limit — this same ML performance can be achieved 
with computational resources that are substantially smaller 
than brute force, and which corresponded to an (effective) 
complexity exponent that was a iion-monotonicj^] piece-wise 
linear function in r, of the form 

c(r) = rT ^1 - , r = 0 , 1 , 2 ,..., n t . 

We will do something similar here, for the multiuser case I\ > 
1, and we will also explore the effect of a few bits of feedback 
on the d(r) and c(r). 


C. Notation and assumptions 

Following ©, we use = to denote the exponential equality, 
i.e., a function /(SNR) is said to be /(SNR) = SNR 6 if 
and only if limsNR->.oo ^logSNtf ^ = Exponential inequalities 
such as <, > are similarly defined. By s = \x] we mean 
the smallest integer s > x, and by t = [x\ we mean the 
largest integer t < x. Capital boldface letters are reserved for 
matrices, and the lower-case boldface ones are for column 
vectors. A is the Hermitian transpose of matrix A, and 
( x) + := max{x,0}. Finally, we define v := min{A', n r }. 

In terms of assumptions, we consider fading coefficients 
that have a circularly symmetric complex-normal distribution 
and which are i.i.d. in space. As stated, the coding duration is 
smaller than the coherence interval of the channel, and hence 
the fading is randomly drawn, but it is held fixed throughout 
the communication process. We will assume that the receiver 
has full channel state information, i.e., knows completely the 
channel vectors hi of every user i, while the users have no 
knowledge of hi in the case of no feedback. Furthermore, 

'The number of codeword visits is — in the high SNR setting — in the 
same order as the number of computational flops. 

Considering decoders with higher complexity than brute-force ML, is 
unnecessary because ML decoders are already optimal. 

3 This non monotonic expression is maximum somewhere in the mid-ranges 
of r. suggesting that the computational resources to achieve the asymptotically 
optimal complexity need not necessarily increase with increasing rate. For 
example, for the simple case of n t = 2, in the presence of the minimum 
required T = nt = 2 in order to achieve the optimal DMT of (r), the 
complexity exponent was increasing as c(r) = r for 0 < r < 1. and was 
then decreasing as c(r) = 2 — r for 1 < r < 2. 
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the rate of communication R is kept constant and thus does 
not change as a function of the channel and of the feedback. 
Finally, different assumptions regarding the structure of the 
code and decoder, will be presented immediately below in 
Section [kP] Assumptions on the feedback-aided user-selection 


algorithm will be presented in Sec. Ill 


D. Lattice encoders and joint decoder 

Directly from {!]) we have the joint real-valued vectorized 
model which takes the form 

y = s/SNRHx + w (5) 


where 


where 


H = It ® 


Rejtfeq} 
Im { H cq } 


-Im{Heq} 

Re{Heq} 


H e q — [h\ • • • hx[ 


( 6 ) 


represents the entire (n r x K) channel fading matrix, where 


x = [Re{xi t i} Imj^ij} • • • Rej^^i} Im {xk,i} 

Re{a;i !2 }Im{xi i2 } Reject, T } lm{x K ,T}} T 

is the joint codeword vector aggregated over all users, and 
where y and w are defined similarly to x. The joint codeword 
x is decoded using a joint ML-based decoder which, in 
its brute-force form, is MAC DMT optimal (cf. 0, ®), 
i.e., achieves the optimal MAC DMT d* ac (r). x is taken 
from a (sequence of) full-rate linear (lattice) code(s) X r = 
X r< i ® • • • ® X t> k, where X r<i = A rji (T 7 Z r j C K 2T is 
the corresponding lattice code for the ith user consisting of 
those elements of the rank 2T lattice A r l that lie inside the 
shaping region 7 Z r i, which is properly chosen to meet the 
rate requirement \X r> i\ = 2 RT as well as the average power 
constraint. The region 7 Z r> i is a compact convex subset of ]R 2T . 
Specifically, we set A r; , := SNR 5 A. ; , to be a scaled lattice of 
another lattice A,; whose generator matrix is denoted by G,. 
Thus for G = diag(G'i,..., Gk)~ the overall codeword is 
given by x = SNR 2 Gs for some s £ X 2KT . Substituting 
this into ([5]) yields the following equivalent channel input- 
output relation which will be used for sphere decoding of s 

y = Ms + w (7) 

where M := SNR^iTG. 

This joint ML (or lattice) decoder is implemented as 
a bounded-search sphere decoder (SD). For SD algorithm 
details, readers are referred to (L|, j2), [26] however for 
clarity of exposition, wherever necessary, essential details are 
provided during the complexity analysis. Implicit to the use 
of a sphere decoder is a chosen search radius S , a chosen 
decoding order corresponding to an order with which symbols 
of s are decoded, and a time-out policy that terminates the 
decoder once the algorithm exceeds a certain computational 
threshold. The termination policies that we use will be clarified 
depending on the setting. Our results will optimize over S by 
setting S := yj z log SNR, for a properly chosen z > 0. The 
idea here is that the search radius should be big enough so 


that — loosely speaking — the probability that AWGN noise 
has a norm larger than this radius, is sufficiently smaller than 
the probability of error under bmte-force ML. At the same 
time, this radius needs to remain sufficiently small so that the 
number of elements within the search sphere is small enough 
most of the time. Finally, the bounds will hold irrespective 
of the decoding order. As a result, we will henceforth limit 
reference to the search radius and the decoding order, mainly 
in proofs. As suggested before, the derivations focus on ML- 
based decoding, but given the aforementioned performance- 
and-complexity equivalence between ML and regularized lat¬ 
tice based decoding (2), these same results extend automati¬ 
cally to the latter. 

We finally note that the validity of the presented bounds 
depends on the existence of actual coding schemes that meet 
them, and which will be identified here, together with the 
associated SD implementation and halting policies. Regarding 
the codes, we hasten here to say that all rate-reliability results 
can be achieved with uncoded QAM transmission for any n r . 
This is one of the crucial contributions of this paper, and it is 
presented in Theorem [2] 


II. Performance-Complexity Tradeoff for MAC 
ML Decoding 


We first provide an upper bound on the complexity exponent 
that guarantees a certain MAC-DMT performance d(r). 

Theorem 1: For the A'-user MAC, the minimum over all 
ML-based decoders (all SD implementations, all halting and 
all decoding order policies) complexity exponent c, miC j/(r) 
required to achieve a certain DMT d(r) < d^ ac (r), is upper 
bounded by 


Umac ,d(f ) 


sup (K — n r )rT 

)J,&B{r) 

+ r sr=i ( r - (! ~ Ai) + ) + > if K > n r , 
sup T [min {r, r + m - 1}] + , 

At eB(r) 

if K < n r , 

( 8 ) 


where y = \pi ■ ■ ■ p, u ] T , v = min {K,n r } and 


B(r) 



hi > • • • > 0 < pi £ M 

Yh =i (\ K - n r \ + 2i - 1) fjLi < d(r) 


Proof: See Appendix A. ■ 

Remark 1: A remark is in order, regarding the high com¬ 
plexity present in the under-determined MAC case of n r < K, 
and particularly regarding the term {K — n r )rT appearing 
in ([S} (and the same in ([9]) appearing later for T = 1), 
which results in an increased complexity exponent, irrespective 
of the desired diversity performance. To gain some insight 
on this, we quickly recall that a sphere decoder performs a 
QR decomposition of matrix M in 0, which though for 
n r < K, results in a matrix if that is an upper trapezoid 
matrix, whose bottom row contains 2T(K — n r ) + 1 nonzero 
entries. Therefore, prior to processing the root node of a 
sphere-decoding tree, the sphere decoder must first search 
exhaustively among N 2T i K - n r -) combinations of the iV-ary 
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PAM constellation points of the integer entries of s (cf. 0). 
For our particular case of having single antenna transmitters, 
N suffices to be chosen as N = SNR 5 , which explains the 
term (K — n r )rT in l[8}. 


A. Complexity cost of achieving the optimal MAC-DMT 


We here seek to derive the complexity exponent needed to 
achieve the MAC-DMT optimal df Llc (r) (cf. Q). To answer 
this question, we will need to understand the coding duration 
requirement T for such DMT optimality. Towards this, we 
recall that a first step towards coding that achieves the optimal 
DMT performance, was taken in [;5J, which showed that for 
the lightly-loaded multiplexing-gain region r < , uncoded 

(QAM) transmission (T = 1) is in fact DMT optimal. The 
question of deriving codes that can achieve DMT optimality 
for all r, was resolved in GD-GD. which provided — 
depending on the values of K and n r — different DMT 
optimal lattice coding schemes by encoding over time, thus 
requiring T > 1. 

The following theorem plays a crucial role in tightening the 
complexity exponent, by proving that uncoded transmission 
(T = 1, QAM) is indeed MAC-DMT optimal for all multiplex¬ 
ing gains. The following result holds for all isotropic channel 
probability distributions, and it extends the aforementioned 
result in G3-(T7), HD to all r, all n r and all K. 

Theorem 2: Uncoded QAM for each user achieves DMT 
optimality in the K'-user symmetric SIMO MAC for all 
r, K , n r . 

Proof: See Appendix |B| ■ 

We can now apply Theorem [T] to provide upper bounds 
on the complexity exponent c^ ac (r) that guarantees the DMT 
optimal d~ n . iC (r). The bound is constructive and is optimized 
over all known ML-based decoders, and over all existing 
codes. The result holds for i.i.d. Rayleigh fading statistics 
and for any decoding order policy. It corresponds to a DMT 
optimal transceiver that employs QAM transmission with 
T = 1, a search radius S > i/ ac (r) log S NR, and a decoding 
halting policy that naturally halts decoding at SNR Cmac ^ flops. 

Theorem 3: For the X-user MAC, the minimum over all 
codes and all ML-based decoders (all SD implementations, all 
halting and all decoding order policies) complexity exponent 
C mac( r ) required to achieve the optimal DMT d^ ac (r), is upper 
bounded by 


sup (K - n r )r + X^ =1 ( r ~ ( l ~ Ti) + ) + > 

if K > n r 


t-mac (t ) — * 


SU P EEi I min {A r + Pi - 1}] + , 

/xSB(r) 

if K < n r , 


where p = [pi ■ ■ ■ p u \ T , v = min {A”, n r } and 


(9) 


B(r) 



Pi > • • • > /A, 0 < pi G M 

EEt (\ K - n r\ + 2i-l) Pi < d* mac {r) 


Proof: It is a direct consequence of Theorems □ and [2] 


The following corollaries provide explicit expressions for 
Cmac(r') for the single-input single-output (SISO) case of n r = 
1, and for n r = I\. 

Corollary 4: For the K -user SISO MAC (n T = 1), the 
complexity exponent required to achieve the optimal df vdc (r), 
is upper bounded by 

CmacM = (K ~ 1 )r, for 0 <r (10) 

A 

Corollary 5: For the K -user MAC with n r = K, the 
complexity exponent required to achieve the optimal df. ic (r). 
is upper bounded by 


CmacO) = r v4v(l - r )J 


-t- \r - 1 + 


K( 1 - 

r) - ( 

V K (1 ^ r) 

) 2 

2 

V K ( 1 - r) 

+ 1 


( 11 ) 


Fig. .0 plots c mac (r) for K = 4,5 and n r = 1, while 
Fig. 0 plots c mac (r) for n r = K = 3,4,5. In interpreting 
the plots below, we recall that the here maximum achievable 
multiplexing gain is r jf. 

Remark 2: As we see from ( fTT) and Fig. [2| when n r = K, 
the complexity exponent c(r) is not monotonically increasing. 
This is simply because an increasing r might indeed increase 
the objective function in the maximization, but at the same 
time it also decreases d(r) and thus also decreases the volume 
of B(r) over which maximization takes place. 



Fig. 1. Bound on the complexity exponent guaranteeing DMT optimality in 
the X-user SISO MAC: K = 4, 5 and n r = 1. 


B. The benefit of increasing the number of receive antennas 

In the following we show the rather surprising result that a 
substantial increase in the number n r of receive antennas, al¬ 
lows for maximal reductions in the complexity exponent. This 
is indeed surprising because, as n r increases, a MAC-DMT 
optimal decoder must handle channels that are increasingly 
more singular Q and thus typically harder to decode because 
near-singular channels typically result in denser signaling 
constellations at the receiver. 

4 Recall from that for n r > K the optimal MAC DMT is d'( n . dC (r) = 
n r (1 — r)+. 
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Fig. 2. Bound on the complexity exponent guaranteeing DMT optimality in 
the iT-user SIMO MAC: K = 3,4, 5 and n r = K. 

Corollary 6: For the A'-user SIMO MAC with n r 3A K, 
the complexity exponent for achieving the optimal MAC-DMT 
^mac( r )’ approaches 

^(r)=-in{r, nr *~ 1 +1 (l —r)| 

and in the limit of asymptotically large n r , the complexity 
exponent tends to zero. 

Proof: See Appendix [C] ■ 

In Fig. [3j a) we plot the complexity-exponent upper bounds 
Cmac(f) for K = 3 users and for various values of n r . The 
corresponding optimal diversity gains df lc (r) are shown in 
Fig. [3jb). It can be easily seen that the bounds c mac (r) (with r 
fixed) decrease monotonically for an increasing n r , and in the 
limit of very large n r , the complexity exponent approaches 0, 
meaning that the decoder V r — with the proper halting policy 

— can deliver the substantial diversity performance n r ( 1 — r) 
in a computationally inexpensive manner. 

III. Performance and Complexity for MAC 
ML-based Decoding with Feedback-Aided User 
Selection 

We now explore the ramifications of feedback-aided user- 
selection on the performance and complexity of ML-based 
MAC communications. Our motivation in exploring user se¬ 
lection comes from the associated gains in MAC reliability 
which — particularly for the high multiplexing-gain region 

— is typically quite low. This same method simultaneously 
allows for a “simplification” of the communication problem, 
from a larger multiple access channel, to a selectively reduced 
smaller and more manageable setting. Finally, this ability to 
provide simpler and more reliable communications, comes at 
a very reasonable feedback cost, which renders user selection 
applicable in the presence of limited feedback links. 

We will here consider a MAC user-selection scheme, which 
is modified from the Jiang and Varanasi (JV) antenna selection 
algorithm in {25[ . The JV algorithm, to the best of our 
knowledge, is currently the best performing antenna-selection 
algorithm for the point-to-point (single-pair) MIMO scenario. 




Fig. 3. Complexity-exponent upper bound e mac (r) (see subfigure (a)) for 
achieving rfm a c( r ) (see subfigure (b)) for K = 3 and for increasing n r . 

The proposed user selection scheme, to be described in more 
detail in Appendix [Dj selects which L out of K users will 
transmit throughout the coherence period of the channel, 
and then informs all users of the selection outcome via a 
feedback channel using log 2 (j) bits per channel coherence 
period. This selection decision is taken as a function of the 
entries of a matrix Rjy derived from the QR-decomposition 
JT eq II = Qjv-Rjv where II is a permutation matrix, where 
Qjv is a unitary matrix, and where — depending on the values 
of I\, A, and n r — i£ JV can be either upper triangular or upper 
trapezoidal. This distinction in the shape of Rjy will here limit 
our consideration of L to an allowable set 

£:={ 1,2,-• - , u, K} 

where we recall that v = inin{ K. n r }, and where L = I\ 
corresponds to the case where no users are pruned out and thus 
where all K users transmit simultaneously to the receiver. Such 
choice of L = K can, especially at high values of multiplexing 
gains, provide for the highest reliability. 

We proceed to bound the DMT gains of user selection, and 
— based on these bounds — to then show that any such 
gains can come with a simultaneous exponentially-reduced 
complexity, compared to the complexity associated to c^ ac (r) 
in Theorem [3 

Remark 3: We note that under the adopted assumption that 
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the fading statistics are uniform across users, then the user 
selection algorithm will in fact ‘prune all users (statistically) 
evenly and thus ‘fairly. It is important to note that the 
rate considered here, does indeed account for the periods of 
‘silence of a user that has not been selected. In other words, 
the derived diversity-multiplexing expressions — which are 
the same for all users — correspond to a user rate that is 
simply the total number of bits sent by each user, divided by 
the total number of time-slots including the time-slots during 
which the user was kept silent by the selection algorithm. 


A. Selection-aided MAC DMT bounds, for a fixed L 


We first proceed with an upper bound on the selection-aided 
DMT dus.rJr) for the employed user selection scheme, when 
the number of selected users is fixed to a certain L. 

Theorem 7: Given L, and given the JV-inspired user selec¬ 
tion scheme, the MAC DMT is upper bounded as 


Jis.lM < d UStL (r) 


min d k ,e 

k>0,,£>l ’ 

k+e.<L 



( 12 ) 


where 


dkA r ) '■= inf D k,l{ot), 

Ae(r) 

Ae(r) := jo < aq < • • • < at : ^(1 ~ a *) + — r | 

and 

i t -1 

D k} e(a) := ^(n r + £-2i + l)aj + ^(iT - k - £)a t 

i=l i—1 

+ a.(.{K — k — £)(n r — k — t + 1). (13) 

Proof: See Appendix [E] ■ 

We here note that it is not hard to show when L = K, that 

dm,K (r) = d^ c (r) = d us , K (r). (14) 

Also when n r = 1, it is easy to show that 

<4s,i(r) = J us ,i(r). (15) 

Hence the bound is tight for L = 1 and L = K. 

In Fig. Hi we plot the DMT upper bounds J L1S ./,(?') from 
Theorem|7j for the (under-determined) case of K = 4 ,n r = 3, 
with L = 1,2,3. In the same figure we compare the above 
dus,L (f) to the optimal MAC DMT corresponding to 

having no selection (or equivalently to having L = K = 4). 
Fig. [5] plots the DMT bounds for the (over-determined) case 
of K = 3, n r = 4, with L = 1, 2, 3. In interpreting the figures, 
one must recall that, due to the fact that each user is selected 
with probability r f, maintaining an average multiplexing gain 
r, will require that each transmitting user communicates at a 
multiplexing gain of j. r . This is indeed taken into account, 
and the multiplexing gain reflects the true rate of communi¬ 
cation, in full consideration of the fact that part of the time, 
some users are not transmitting. It should also be noted that for 
L < v, the maximal achievable multiplexing gain is jf or each 
user. Furthermore, it can be seen from the curves in Fig. [4] and 


Fig. [5] that the DMT performance of proposed feedback-aided 
user-selection scheme strongly depends on the choice of L for 
different regions of multiplexing gain values r. For instance, 
in Fig. [5] of K = 3 and n r = 4, choosing L = 1 yields the 
largest diversity gain whenever r £ (0, 0.2]. For r £ (0.2, 0.5], 
L = 2 becomes the best choice, and if the desired multiplexing 
gain r £ (0.5,1], one should set L = K = 3. A similar 
observation can be made in Fig. [4] These cross-points between 
curves ci U s,L(f) for various L suggest further diversity gains 
by optimizing over L as a function of r. This will be discussed 
in more detail in Section 1III-CI 

Another interesting observation from Fig. [4] and Fig. [5] is 
that despite the different configurations of K and n r in these 
figures, the respective initial values d us ,L(0) are the same for 
L = 1,2,3. This is actually no coincidence. To see it, note that 
as multiplexing gain r approaches zero from the right, meaning 
that the desired transmission rate R is arbitrarily close to zero, 
the only way for the communication system being in outage 
is exactly when all the major channel links are in deep-fade, 
i.e., when aq = ... = = 1 in the set In this case, 

the function Dk/(a = 1) simplifies to 

DkA 1) = Kn r — k(K + n r ) + k 2 + k£, 

hence the minimal £>^(1) occurs at £ = 1 and 

J us l( 0) = min Kn r — k(K + n r ) + k 2 + k. 

’ 0<fc<L-l 

This shows that the value of d u Sj l( 0) is symmetric between 
K and n r whenever L < min {K,n r }. Moreover, with K = 
4 and n r = 3 (and the same holds for K = 3 and n r = 
4) we have J us ,l( 0) = min 0 <fe<L_i k 2 — 6k + 12, and it 
is straightforward to see that d USj i(0) = 12, d us , 2 ( 0 ) = 7, 
Jus, 3 ( 0 ) = 4, and J USi 4(0) = 3 as shown in Fig. |d] 



Fig. 4. Selection-aided DMT upper bounds d us ijr), vs. the optimal MAC 
DMT d^ ac (r) without selection. K = 4 , n r = 3, and L = 1,2, 3. 


B. Selection-aided MAC complexity bounds, for a fixed L 

We now use the previous DMT bound to upper bound 
the complexity exponent c us ^(r) required to achieve this 
selection-aided DMT d^jfrj corresponding to the proposed 
/,-user selection algorithm, for any fixed L £ C. The result 
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Fig. 5. Selection-aided DMT upper bounds d USj i,(r), vs. the optimal MAC 
DMT djj ac (r) without selection. K = 3, n r = 4, and L = 1,2. 


is a direct consequence of Theorem [3] and it holds for i.i.d. 
Rayleigh fading channel statistics. 

Theorem 8: For the A'-user MAC, the complexity exponent 
Cus,i(r) required to achieve the selection-aided DMT d us ,L(r), 
for a given L <v, is upper bounded as 


Cus,i(r) < c USii (r) 


where 



K I< 

T r, T T + n -l 



(16) 


cti < • • • < «l, 0 < ai £ R. 1 
D 0 ,l(o.) < d USjL (r) J 

and where Dk/(ot) is defined in ( fl3| . The performance- 
complexity pair (d USi z,(r), c us ,i(r)) is achieved with uncoded 
QAM (T = 1), a sphere decoder with a search radius S > 
d a s,L{t) log SNR, any decoding ordering, and a decoding 
halting policy that halts decoding if N max (r) = SNR Cu!,z '( r \ 



Proof: The result holds directly from Theorem [3] and 
from the fact that the complexity needed to achieve d us ^ (r) is 
upper bounded by the complexity needed to achieve d us > 


c U s,i*(f) as a bound for the complexity required to achieve 
the optimized selection-aided DMT maxi £ £{c! US: t(r)}. 

For the under-determined case of K = 4, n r = 3, Fig. [6] 
plots L*(r), while Fig. [Tjplots the optimized, over L, selection- 
aided DMT upper bound d USj L*(r). This bound is also com¬ 
pared to the bound d USj L»* (r) where 

L**(r) = argmax{J US) i(r)} (19) 

L<v 

corresponding to the case where selection always happens, i.e., 
corresponding to the case where we do not allow for L = K, 
even if L* (r) = I\. 

Fig. [8] plots the complexity exponent upper bound c USi l» (r) 
that guarantees the optimal — over all L — selection-aided 
DMT d usJ ., (r). This complexity exponent bound is com¬ 
pared to c mac (r) that achieves without user selection. 

The same figure also gives an interesting comparison to the 
complexity exponent c U s,l**(0 which, in conjunction with 
<7in Fig- [t] suggests that the choice of L**(r) — 
rather than L*(r) — can provide substantial gains in com¬ 
plexity, with possibly moderate losses in performance. 

4 ---T-,--,-,-,- 

3.5 

3 - 

2.5 

£ 2 - 

1.5 

1- 

0.5 

°0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 

Multiplexing gain r 


Fig. 6. Optimal number of selected users L* ( r ) for K = 4-user MAC with 
n r = 3. 


Xr). 


C. Bound-based performance and complexity optimization 
over L 

The performance and complexity bounds in Theorems [7] and 
[8] suggest gains by optimizing over L as a function of r. Using 
the bounds as an indicator to the best choice of L, we let 

L*(r) = argmax{d USi i(r)}, (17) 

where we recall that C = {1,2,... ,v, K}. Hence 

7 us ,i*(r) = max{J USjL (r)} (18) 



can serve as an upper bound on the selection-aided DMT 
max ie £{4,L(r)} maximized — at any given r — over the 
choices of A, including over the choice L = K that all users 
are selected. As a result, going back to we can calculate 


Fig. 7. DMT upper bounds d us l* (r) for user selection, compared to d^ ac (r) 
without selection, and d us £,** (r) when user-selection is enforced, for the 
K = 4-user MAC with n r = 3. 



























Fig. B. Complexity exponent upper bounds c US] /.+ (r) for user selection, 
compared to c m ac(r) without selection, and compared to c us /,+■ (r) corre¬ 
sponding to when user-selection is enforced. Plots for K = 4 and n r = 3. 


D. Selection-aided complexity reduction, at no performance 
costs 

We here explore the particular case where user selection is 
calibrated to reduce complexity, while maintaining reliability. 
The following quantifies this approach, based on the derived 
bounds. A crucial element in this effort is that any gains 
in the selection-aided DMT compared to the original MAC- 
DMT dy l)nc (r), can allow us to effectively ‘rest’ the decoder 
often enough to reduce the complexity exponent, without 
falling below the required d^ ac (r). In this case, the resulting 
complexity exponent is upper bounded by 


Cred-us,Jt ( f ) 


sup Ef=i [min {f r, fr + a,-l}] + , 

ae IFred-us.i (r) 

if d us , L (r) > < ac (r), 


z(r), 


if otherwise, 

( 20 ) 


where 


red-US, L (c) 


oil < Ct2 < ' ' ' < CtL, 0 < OLi £ K. 
D 0 , L {a) < d* mac (r) 


and where the second case in ( |20| ) refers to the situation where 
L* = K. Hence an upper bound on the associated complexity 
exponent, minimized over L , is given by 


Cred-us(r) = minc re d-us,i(r). 


( 21 ) 


Fig. [9] explores this for the case of K = 4 and n r = 3, pre¬ 
senting the resulting upper bounds c,cd-us.f, (r) for all possible 
L = 1,2,3. The optimized complexity reduction, under the 
condition that we do not reduce reliability below dJnac( r )’ ' s 
given in Fig. 10 The striking observation is that at low or 
moderate multiplexing gain regimes, the optimal d^ ac (r) can 
be achieved with a complexity exponent that can be as small 
as 0, i.e., with computational complexity that does not scale 
exponentially in the total number of codeword bits. At high 
multiplexing gains, no complexity reduction is available since 
L* = K. 


Fig. 9. Complexity exponent bounds c re d-us,i(r) with L = 1,2,3, all 
guaranteeing d* IAC (r) performance for K = 4 ,and n r = 3. 



Fig. 10. Complexity exponent bounds Cred-usM, Cus £* (r) and c mac (r), all 
guaranteeing d* iac (r) performance for K = 4, n r = 3. 


IV. Conclusions 

In this paper we derived bounds on the performance and 
complexity behavior of ML-based (and lattice-based) decod¬ 
ing, for the multiuser multiple access channel. Emphasis on 
ML-based decoding was motivated by the fact that MAC- 
related settings have distinctively reduced reliability, which 
hinders the use of low-performance decoders that would 
further reduce this reliability. 

The derived complexity-vs-performance bounds were pre¬ 
sented in the form of diversity and complexity exponents, 
and can provide insight on how to tradeoff performance and 
complexity in such outage-limited multiuser settings. The 
analysis shows that, under the requirement of efficient ML- 
based decoding, this tradeoff is not trivial; indeed we see that 
complexity constraints can crucially deteriorate performance. 

The derived bounds also suggest substantial reliability and 
complexity benefits by increasing the number of receive anten¬ 
nas, as well as substantial benefits by utilizing just a few bits of 
feedback to allow for user selection. User selection is indeed 
pertinent as it can inherently increase reliability through chan¬ 
nel selection, as well as can inherently decrease complexity 
by simply and selectively reducing the size of the problem at 
hand. These gains are highly sought in multiuser settings like 
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the MAC, which typically suffer from reduced reliability and 
increased complexity of decoding. In this context the analysis 
reveals the interesting finding that proper calibration of user 
selection can reduce the complexity exponent of near-optimal 
ML-based decoding, down to zero, thus revealing that — for 
a substantial range of multiplexing gains — the computational 
complexity of near-optimal ML-based decoding, need not 
scale exponentially in the total number of codeword bits. 


Appendix A 
Proof of TheoremQ] 

In the following we establish an upper bound on the min¬ 
imum complexity exponent c maCi d(r) required by ML-based 
decoding to achieve a MAC DMT performance d{r). 

We recall from (0 that the sphere decoder ‘sees’ a channel 
model 

y = Ms + w (22) 

1 —r 

where M = SNR^ - HG, where H is the equivalent channel 
in and where G is the generator matrix of the overall 
(product) lattice code. To allow for sphere decoding in the 
under-determined (K > n r ), MMSE-preprocessing (cf. (26)) 
gives the MMSE-preprocessed code-channel matrix 


M = 


M 

Otl u 


QR 


(23) 


where I u is the uxu identity matrix, where u = 2 (K — n r )T, 
and where a = SNR -3 . If K < n r , then simply a = u = 0. 
After the QR decomposition in ( |23| , we get 

r := M^y = Rs + w' 


where w' = —a 2 s + M^w. For 

S r := js G Z 2KT : SNR -3 Gs G X r j 

being the set of points in the product lattice A that constitute 
X r after scaling, the sphere decoder takes the form 

Smmse-sd = argmin ||r — Rs \\ 2 (24) 

and it recursively enumerates all candidate vectors s G S r 
within a given search sphere of radius 5 = y/z log SNR for 
some z > d(r). 

To compute an upper bound on the complexity exponent, 
we follow the approach similar to (2|. Towards this, we first 
let 


A i = oi ( H e q ), i = 1,..., v = min{A', n r } 

be the nonzero singular values of T/j q H cq , arranged in ascend¬ 
ing order, and then we let 


log A j 

log SNR' 


(25) 


Hence we can write 


a.,(R) = <Ji(M) = a 2 + i = 1,..., 2KT. 

(26) 


For K < n r , we have 

a t (R) = SNR’ 
and for K > n r we have 


(l-r-i -.) 

I o T 1 


Vi(R) = 


SNR - 5, if 1 < i < 2 T(K - n r ) 1 

SNR - 5 + 5 (1-Mi)+ , otherwise J 

(27) 

where j = ^ nr ) ~\. In the above we have used that 

cri(G) = SNR 0 , for all i. 

We now see that for any given channel realization, corre¬ 
sponding to a specific y, := [y\ - ■ ■ /i;,] T (cf. <[25]»), the total 
number of visited nodes is given by 

2KT 

N S M := £ JV fc (/x) 


k =1 


where, drawing from (!] Lemma 1], we can see that 


m ») < n 

and thus that 


i=l L 


Vk + 2 mil 


o-i(R) 


2 KT k 


Vk + 2 mir 


a,{R) 


, VkSNRi 


VkSNRi 


-^sd(m) < En 

k—l i= 1 L 

Thus for the overdetermined case (cf. ©), we have 
AWm) < SNR t [ min { r . r ’+Mi - 1 }J + 

while for the underdetermined case (cf. we have 

JVsd(/*) < SNR ( - ftr-n ’- )rT+ ' rE -i( r-(1-/i * )+ ) + 


(28) 


(29) 


where we have used the 2T-fold multiplicity of the singular 
values of H. 

At this point, in the same spirit as in 0, the upper bound on 
the complexity exponent can be obtained as the solution to a 
constrained minimization problem of finding a value c mac d(r) 
such that the probability of a premature termination of SD 
algorithm is no larger than the channel outage probability, i.e., 

Pr {n s M > N max (r ) = SNR C “^«} < SNR -d «. 

(30) 

This is the optimization reflected in the complexity exponent 
bound of Theorem [I] ■ 


Appendix B 

Proof of Theorem[2} Optimality of Uncoded QAM 
For T = 1, the rank-2 A' overall (product) code-lattice A 
is isomorphic to the rectangular lattice Z 2A , and the overall 
(product) code 

X r = jsNR - Sa; : a; G (Z[i]) K ,|^| 2 < SNR r | 

is essentially uncoded (scaled) QAM. To show that the above 
code X r satisfies the probabilistic complexity constraint ( [30] ), 
we follow the footsteps in tig, gd and consider a A'-fold 
extension of X r 

K 

a;, ext = 0A r C SNR -^-M k (Z[i]) 

i—1 
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where M X (Z[*]) denotes the set of K x K matrices with 
entries from Z[ ?. ]. In other words, elements of X rcxl are square 
I\ x K matrices whose entries are independent QAM constel¬ 
lation points after a scaling of SNR -5 . Naturally, for any given 
decoder, X r ext and X r achieve the same DMT performance. 
Under brute force ML decoding, the error probability of X rcxl 
is upper bounded by 

^e,ext (t*) 

= EPrjX' £ £{X) decoded} 

r K 

= EPr k (X) decoded} 

lfc=i 

(i) K 

< ^^SPrlX' £ £ k [X) decoded} 

fe =1 

<X> Pr( U 

fc=i [x'e£ fc (x) 



where the expectation is taken over all codeword ma¬ 
trices X £ AV.exi, where £(X) := A^ext \ {X} is 

the set of all possible erroneous decoded outputs given 
that codeword matrix X is transmitted, where £ k (X) := 
{X' £ £(X) : rank(X - X') = k) with k = 
forms a partition of £{X), where step (i) follows from the 
union bound, and where step (ii) is due to the use of a 
suboptimal bounded distance decoder (cf. 03)- 
For any X' £ £ k {X), set A x - = X - X' and let 
A X ,A^, = Ux.Vx 'U^ x , be the corresponding eigen- 
decomposition. Note that as rank(A X ') = k, the eigenvalue 
matrix S X ' has form S X ' = diag(L2 X ', Ox-k), where 
the diagonal of J7 X / consists of all nonzero eigenvalues of 
A X 'A^,. Substituting the above into (|3T|), we obtain 




where G k is an (n r x k) random matrix with i.i.d. CM (0,1) 
entries. Let Ai < • • • < X m be the ordered nonzero 

eigenvalues of G\G k , where m = min{/c,n r }. Noting 
that det(fi X ') > SNR -fcr and that Tr(fi X ')<l, it can be 
shown — drawing from 03 — that the condition of 
Tr^G[.n X 'Gfc) <1 implies that ~ f t i) + < kr, 

which is independent of the choice of X 1 . Hence we have 



= Pr < ft : ^(1 - m) + < kr 
l i=l 

= SNR _d *.»r (fcr ) 

where the last exponential equality follows from (6]|. Finally, 
note that the error probability of X r , subject to (unterminated) 
joint ML decoding, is upper bounded by 

1 1 K 

Pe{r) < 77^,ex t(r)<-VSNR-^W = SNR-^-W 
A A i ' 

fc=l 

which completes the proof. ■ 


Appendix C 

Proof of Corollary[6] 

First let us recall that for n r > K, then v = iniu{ K, n r } = 
I \. Let us also recall from |6) that the joint probability density 
function of fi = [fj,i ■ ■ ■ (cf. ©) satisfies 

p(/x) = SNR - E)£i ( «r-A+2i- 1) /ii 


provided that Hi > 0 for all i. As a result, for each 
/x € (WX) K , the corresponding complexity exponent is upper 
bounded by 

K 

c (h) = [minjr, r + m - 1}] + (33) 

1=1 

which is not a function of n r . At this point, let us consider a 
decoder that decodes only when 

K 

- K + 2i- 1 )/j,i < d* l nr (r) 

i =1 

while when (n r — K + 2i — 1)/Xj > d\ nr {r) the 

decoder simply declares an error. Since, for n r > K, d\ n (r) 
dominates the MAC DMT (cf. @) , the extra declared errors 
do not affect the overall diversity performance. 

Now we note that for any > ■ ■ ■ > ft k > 0 such that 
Yli(. n r — X + 2i — 1 )ni = n r ( 1 — r), we have that 

y^(n r — K + l)ni < n r (1 — r) < ^^(n r — K + 2K — 1 )/Zj, 


and thus that 

n r 

n r + K — 1 


(1 - r) < V m < 


n r — K + 1 


(1 -r) (34) 


which, for n r X> K, implies that 


n r 

n r + K — 1 


(1-r) 


n r 

n r — K + 1 


(l-r) 


(1-r). 


Subject to the constraint in ( [34] ), we can see that the maximum 
value of c(/r) in ( |33| ), is achieved by setting „ ™k+i (^~ 
r) and /i 2 = • • • = I^k = 0, where this maximal value takes 
the form 


< Pr \ 1J j H - Mi) + < kr ( 

x’e£ k (x) 


i=1 


K 

Y, [min{r, r + /i* - 1}] + 

i= 1 


I< - 1 
n r — K + 1 


(1-r) 
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Hence for n r K , 


K 

Cmac(r) = sup ^ [min{r,r + m» - 1}] + 

MSB(r) i=1 

K 

< sup ^ [min{r ,r + fj,i — 1}] + (35) 

M6 B(r) i=1 

K - 1 , . r 


= mm < r. 


where 


, _ , Mi > • • • > Mif. 0 < ^ G K, 

^ 1 M 1 ZZi ( K ~n r + 2i-l)^< n r ( 1 - r)+ 


-Rjv = Ql ''' Qi-Heqlli • • • Ili. The users associated to the 
first L columns of /?, lv are the selected ones f] 

Appendix E 
Proof of Theorem^ 

Recalling from the description in Appendix [D] of the se¬ 
lection algorithm, to select the L users, there are L iterations 
of applying column permutation matrix IT, and Householder 
transformation Q l to obtain an output matrix /i lv , which is 
of the form 

ftjv = Ql ' ■ Ql-Heqlli • • TIl 

n i * • • • * * * 

7*2 2 • * * * * • * * * 


and 


B(r) 



Mi > ••• > M*r,0 < Mi € K 1 
E^iMi<H^fe+r(l-r)+ /• 


The inequality in ( |35j ) is due to the fact that B(r) C B(r), 
while note that equality holds when /ii = n ™j <+1 (1 — 
r), M 2 = • • • = Mif = 0, which in turn implies that 


Cmac(r) = min < r, 


K - 1 
n r — K + 1 


(1-r) 




fi,L * ’ ’ ’ * 

* • • • * 

Let m be the user associated with the /th column of Rjy, and 
let {ui,U 2 , ■ ■ ■ ,ul} be the set of selected users. Since entries 
of H eq are i.i.d. CW(0,1), to meet an average multiplexing 
gain r for each user, the selected user has to transmit at 
a possibly larger multiplexing gain of For the set of 
‘modified’ outage events 


which establishes the corollary. ■ 

Appendix D 

Description of User-Selection scheme 

The proposed user selection algorithm can be drawn from 
|25;|, by restricting selection to only transmit antennas (no 
selection of received antennas). In the setting with K single¬ 
antenna users, H eq (cf. ©) has as fcth column the vector 
h/- corresponding to the channel vector of the fcth user. 
The goal is to select L out of K users for transmission 
(L < v = min{/i, n r }). The selection process is closely 
related to the QR-decomposition of H eq using a Householder 
transformation, which exhibits good numerical stability. The 
selection takes L iterations, and each iteration consists of two 
steps, a matrix-column permutation followed by a Householder 
transformation. At the first iteration, the algorithm begins 
by finding the column hj 1 of H eq with the largest column 
norm. Then it right-multiplies H eq by a permutation matrix 
Hu to swap hi and h f t . The second step is to apply a 
unitary Householder transformation Q 1 to iT cq IT| such that 
the top-left entry of Q 1 H eq Tli is the only nonzero (and 
positive) entry in the first column. After finishing with the 
first iteration, the algorithm shifts its focus to the trailing 
(n r — 1) x (K — 1) submatrix Hi of Q 1 H eq TLi. Similar to 
the first iteration, the algorithm identifies the column with the 
largest norm in Hi, swaps it with the first column, and then 
applies the Householder transformation. Thus, at the end of 
this iteration, the channel matrix becomes Q 2 QiH eq niH 2 . 
The third iteration is the same as the previous ones but focuses 
on the trailing (n r — 2) x ( I\ — 2) submatrix. The same 
process is repeated L times, resulting in an output matrix 


Ok./ := {the sum-rate of users Uk+i, ■ ■ • ,Uk+e is in outage} 

(36) 

for all k > 0, t > 1, and k + £ < L, we will see later on — in 
the process of the proof — that the outage event considered 
by Jiang and Varanasi 125 Theorem 4.1], is a special case of 
the above events corresponding to k = 0 and £ = L, i.e., a 
special case of the outage event Oq.l- 


A. DMT Analysis for error event Ok,t 

To analyze the DMT for the error event Ok ,e for any k > 0, 
£ > 1, and k + £ < L, let Rk.t be the matrix resulting from 
applying (fc + £) iterations of the Jiang-Varanasi algorithm to 
overall matrix H eq . We partition matrix Rk.t: as follows: 


where 


Rk,t — Qk+l ’ ’ ’ Ql^IeqUl * * ' 

Rl Rc,U Rr,U 
= Rc,B Rr,M 

Rr,B 


r i,i • • 

• ri t k 



Tk,k 


r l,k+\ 

fi ,k+e 

fk,k+1 

fk,k+t 


(37) 


It is worth mentioning that if K = n r = L, then Rj\i is an upper 
triangular matrix, and thus H eq IIi 11^ has a QR-decomposition equal to 
(Ql '' ' Qi^Rjv. Also, as the selection focuses only on a trailing submatrix 
of Q rn _ i • • • Q x .HeqTIi • • • Qm—i during the ra-th iteration, one cannot say 
that the zth column of Hjy has the zth largest norm among all columns, for 
i = 2 ,-,L. 
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r fc+l,fc+l ’ 

fl.fe+i+l 

Pfc.fc-hM-l 
Uc+l,fc-M+l 

rk+e,k+e +1 

Ut-M+iA+f+i 
■Rfl.S = ; 

UvA+^+1 

and where the entries satisfy 


Rc,b = 


Rr,u = 


Rr,M = 


fk+l,k+i 

r k +e, k +e 
n,K 

fk,K 
r k +i,K 

rk+e,K 

r k+t+l,K 

r n r ,K 


> E 


r m,j | i 


(38) 


for i = 1, 2, ..., k + £ and for j = i + 1,..., K. 
For 

Rc,u 


Rc = 


R< 


C.B 


the probability that the sum-rate of users u k + i,--- ,Uk+e 
results in an outage, takes the form 


Pr{0 M } 

= Pr { logdet (I nr + SNR R c H,t ) < ^-£r log SNR 


I< 

T 


!x(v) 


k+e. 

IK 

i= 1 


■2(n r -i+l) 


Ti, 


K 

n 

u'=*+l 


a -\nj\ 2 


:= SNR 




: , ,,, - g-II^.BlI 2 dR (39) 

?r ( K—k—Z){ni — k—t) n, ’ c v 7 

(40) 


where c is a constant relating to ordered statistics (cf. the 
first constraint in (|4T|>), where p x 2 (•) is the probability density 
function for a \ 2 random variable with k degrees of freedom 
and with mean and where the integration region is 


> y Ur . 


I[r) := < R k ,t : i = l,...,k + £, j = i + l,...,K l 
[ det (l nr + SNR R c R f c ) < SNR^ £r | 

(41) 

We have the following three remarks which we will jointly 
use later on. 

Remark 4: We first note that the diversity exponent associ¬ 
ated to Pr {Ok/} from ([39}, takes the form 


dk,e 



lim 

SNR—^00 


log Pr {Ok/s 

log SNR 


inf 


D (R k4 ) 


where the last equality is a manifestation of Laplace’s princi¬ 
ple |6j|, and where D (R k ,e) is a specific diversity-exponent 
function corresponding to the integrand in ([39}. To understand 
the above, we shall look for sub-events of I(r) that yield the 


smallest possible diversity exponent D (R k ,e). To this end, 
note that the entries in Rf and Rr } u are not involved in 
the second constraint in I(r), i.e., are not involved in the 
constraint det ^ I nr + SNRi?c , i?^ < SNR Tfr . Furthermore 
one can see that smaller absolute values of nonzero entries in 
Ri and Rr_,u correspond to larger values of D {R k /). We 
hence conclude that the dominant sub-event of I(r), and the 
one yielding the smallest diversity exponent D (R k /)), must 
consist of matrices R k ,i for which the nonzero entries of the 
associated submatrices R/ and Rr,u are the most typical 
ones. In other words, the smallest value of D (R k /) must 
result from the case when the nonzero entries of submatrices 
Rj and Rrjj have magnitudes in the order of SNR 0 . 

Remark 5: Let Ai > • • • > A/ be the ordered singular 
values of Rc, and let v\ > ■ ■ ■ > v/ be the ordered singular 
values of Rc.b- Clearly, as Rr i>R c ,b — R (; R' C , we have 

i = l,2,...,L (42) 


Using [|25[ Lemma 3.3], we have 


(Rc,b) u = 


r z > 

k-\-i,k-\-i — 


E = 


3^ ^ 


e-i + i 


(43) 


for i = 1,2,..., i. Moreover, from [25 Eq.(27)] we have that 
the squared diagonal elements in Rc.b are multiplicatively 
majorized by its squared singular values, i.e., that 


m 


m 


IK - n r i+i,k+i 

i =1 i=1 


m 



2—1 


m = 1,2,(44) 


where the second dotted inequality is due to ((43]). It then 
follows that 


r 


k-\-i,k-\-i 


(45) 


for i = 1,2,..., t. 

Remark 6: After the first k iterations of the Jiang-Varanasi 
algorithm, we get 

Qk' ’ Q [ C^cqTll • Ilfc 

ri i * ••• * * * 

7*22 • • • * * • • • * 

ffc.fc * • • • * 


* 


Rl 

0 


Rr 


where Rr is the ( n r x {K — k)) matrix consisting of the 
rightmost (K — k ) columns of the above matrix. In Remark 
[4] we argued that the dominant matrices R k j in event X(r) 

— yielding the smallest possible diversity exponent D (R k j) 

— must have submatrices Rr whose nonzero entries are of 
magnitude that is in the order of SNR 0 . This implies that 
the entries of Rr can still be regarded as i.i.d. CM (0,1) 
random variables since any ensemble of random CM (0,1) 
variables has the same asymptotic probability as those subject 
to an additional constraint of magnitude lesser than SNR 0 . 
Consequently, the ordered singular values A i,i = 1.....C, 
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of matrix Rc are still of the same joint probability density 
function as that for the ordered singular values of an (n r x £) 
random matrix with i.i.d. CAF (0, 1) entries. 

With the above three remarks in place, we now proceed to 
analyze the integral ( |39| to obtain a formula for the DMT 
function dkg{r). Specifically, we will show that 

, t e-i 

dk i(t) = inf < y^(n r + £ — 2i + l)ai + 'S^(K — k — £)cti 

h 
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+ ae(K — k — £)(n r — k — £ + 1) j (46) 

where 

At(r) = L<a 1 <...<a e : ^(1 - cti) + < r 
l i= 1 

To see the above, set Xf = SNR _ “ ; with a\ < (x-i ■ ■ ■ < 
at. The first summand appearing in ( |46| follows from the 
aforementioned fact that the joint probability density function 
of ordered singular values Ai > • • • > Xg for an (n r x £) 
matrix with i.i.d. CAf (0, 1) entries is (cf. |j6j, 1251) 

p(a a t ) = SNR _ ^=i (n ’" M_:M+1)Qi . 

Also by ( [38] ), ( |42| ), and ( [45] ), we have the following constraints 
for the entries in matrix RkX- 

1) For i = 1, 1, the entries I'k+i.j (j = k + £ + 

1,..., K) must satisfy 

|Tfc-H,.7 I — |^fc+i,fc+i| — Aj . 

The constraints on rk+ij contribute to ( [46] ) the second 
summation, i.e. the term Xii=i (K — k — £)ai. 


2) Entries rk+ij with i = £,... ,n r — k and j = k + £ + 

1 ,,K, must satisfy 

n r — k 

'y ] l r fe+i,il — \ r k+e,k+e\ = Mr — A^ 

%=t 

Such constraints contribute to ([46]) the last summand, 
i.e., the term ag(K — k — £)(n r — k — l + 1). 

This completes the proof of ( |46| >. 

Finally, the proof of Theorem [7] is complete after noting that 
the union of outage events Ok,e, is a subset of the overall 
outage event, i.e., that 

U o M c u : users in U are in outage} . 


Remark 7: It is interesting to see that the antenna-selection 
DMT in Jiang and Varanasi ( f25[ Theorem 4.1]), can be 
derived as a special case of Specifically it can be shown 
that when k = 0 and £ = L < v, the corresponding 
coincides with the DMT in |25| , as both functions 
are piecewise linear, connecting the following (P + 2) points 


(r, (K - r)(n r - r)), r = 0,1,..., P, and (L, 0), 

where 

D • {K ~ P){n r ~ P) 

P = arg mm ---. 

p=0,l,...,L-l L—p 


(47) 

(48) 
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